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Abstract 



This paper develops upper and lower bounds for the probabihty of Boolean expressions by treating multiple occurrences of variables 
as independent and assigning them new individual probabilities. Our technique generalizes and extends the underlying idea of a 
number of recent approaches which are varyingly called node splitting, variable renaming, variable splitting, or dissociation for 
probabilistic databases. We prove that the probabilities we assign to new variables are the best possible in some sense. 
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>^ Highlights. • We give upper and lower bounds to the prob- 
, ^ ability of Boolean expressions. • We generalize and extend re- 
cent related approaches. • We show the main results in this 
paper are best possible in some sense. 
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1 1. Introduction 

Several recent papers propose to approximate an intractable 
counting problem with a tractable relaxed version by treating 
multiple occurrences of variables, nodes, or tuples as indepen- 
dent, or ignoring some constraints. Choi et al. El approximate 
inference in Bayesian networks by node splitting, i.e. remov- 
ing some dependencies from the original model, and show how 
the technique subsumes mini-bucket elimination Q- Ramirez 
and Geffner [3 1 treat the problem of obtaining a minimum cost 
satisfying assignment of a CNF formula by variable renam- 
ing, i.e. replacing a variable that appears in many clauses by 
many fresh new variables that appear in few. Pipatsrisawat and 
Darwiche |4| provide lower bounds for MaxSAT by variable 
splitting, i.e. compiling a relaxation of the original CNF. An- 
dersen et al. |5| relax constraint satisfaction problems by re- 
finement through node splitting, i.e. ignoring some interactions 
between variables. In our recent work ||6|, we develop a tech- 
nique called dissociation to approximate the ranking of answers 
to intractable conjunctive queries. 

In this paper, we study the probabilities of Boolean expres- 
sion after treating some occurrences of variables as indepen- 
dent, and assigning them new individual probability values. We 
call this approach dissociation. It turns out that the unifying 
idea of the above papers is to provide either lower bounds for 
conjunctive expressions, or upper bounds for disjunctive ex- 
pressions by assigning dissociated variables their original prob- 
abilities. We show that these bounds can be understood as duals 



and give new and non-obvious upper bounds for conjunctive 
and lower bounds for disjunctive expression (see |Fig. l| l 
further show that those are optimal in some sense. 



We 



We start with some necessary notations (Sect. 2i and defi 



nitions ( Sect. 3 i, provide our main results ( Sect. 4 1, treat binary 



dissociation separately ( Sect. 5 1, illustrate the results with ex- 
amples ( [Sect. 6| l, and give the full proofs in the appendix. 

2. Notational and Mathematical Background 

We use [k] as short notation for {1, ... ,k}, write x, as short 
notation for x,-, / e [k] if k is clear from the context or not rele- 
vant, use the bar sign for the complement of an event or proba- 
bility (e.g., X - -ijc, and p - 1 - p), and use a bold notation for 
sets or vectors of variables (e.g., x = (jci , . . . , Xk)) alike. Proba- 
bilities are always assumed to be between and 1 . 

Our treatment of Boolean expressions is notably inspired 
by Crama and Hammer Q, and by Fuhr and Roll eke fE\. We 
assume a set x - {xj , . . . , x^} of independent Boolean random 
variables, and assign to each variable x, a primitive event (we do 
not formally distinguish between the variable x, and the event 
X, that it is true) which is true with probability P[x,] = pi. 
Thus, all primitive events are assumed to be independent (e.g., 
P[xiX2] = PiPi)- We are interested in computing the proba- 
bilities of composed events, i.e. event expressions (p which are 
logically composed of primitive events. 

In our formalism, we make further use of a set A of complex 
events, which are composed from primitive events. It is known 
that arbitrary correlations can be represented with event expres- 
sions starting from independent Boolean random variables only 
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(see Appendix A i. The correlation p{A, B) between Boolean 
events A and B is defined as p{A,B) = [91, with co- 

variance cov(A, B) - P[AB] - F[A]P[B] and variance var(A) - 
P[A] - (P[A])^. Hence, complex events can be arbitrarily cor- 
related (e.g., < F[AB] < P[A], or equally -1 < p(A, B) < 1). 

We write 0(x) to indicate that x is a set of primitive events 
appearing in the expression Whenever we write (^(x. A), 
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we imply that a set x of primitive events and a set A of com- 
plex events are both appearing in (p, and that x are indepen- 
dent of A, i.e. the complex events A are composed of primitive 
events different from x. For example, ^{x, A) may be defined 
as (p{{x], {A, B]) - xAy xB, and the complex events A and B as 
A := yiyo and B := yiy2 over y = (yuyi)- 

The dual of a Boolean expression is obtained by exchang- 
ing the operators V and A, as well as the constants and 1 Q 
Def 1.8]. The duality principle states that if a Boolean ex- 
pression is valid, then so is its dual [7, Th. 4.4]. It plays an 
important role in our paper The two dual De Morgan 's laws Q 
Th. 1.1.10] state that -.(A V B) = AB and -.(AB) = A V B. With 
absorption, we refer to the two dual identities E] Th. 1.1.11]: 

AV B ^ ABW B 
AB = (A V B)B . 

The (disjunctive) inclusion-exclusion principle |10, R. 11.8.1] 
and its less-known conjunctive dual state: 

P[A V B] = P[A] + P[B] - P[AB] (1) 
P[AB] = P[A] + P[B] - P[A V B] (2) 

From absorption and two special inclusion-exclusion cases: 

P[A V B] = P[A] + P[B] (if P[AB] = 0) 

P[AB] = P[A] + P[B] - 1 (if P[A V B] = 1) 

we get the following dual rules, which we call event splitting: 



P[A V B] = P[AB] + P[B] 

P[AB] = P[A V B] -H P[B] - 1 

3. Dissociation and Statically-tight Bounds 



(3) 
(4) 



In this paper, we are interested in statically-tight bounds for 
dissociated expressions. In this section, we define and illus- 
trate these two concepts. Intuitively, a dissociation (p' of an 
expression ^ is derived by treating multiple appearances of the 
same variable as independent, and assigning them individual 
new probabilities. We are then interested in assigning prob- 
abilities to these new variables so that the probability of the 
dissociated expression P[0'] is always either an upper or lower 
bound for P{4>\- Furthermore, we want to assign such probabil- 
ities which (i) can become tight, and which (ii) guarantee the 
best bounds possible when ignoring all the other variables. We 
call such bounds statically -tight. 

Definition 3.1 (Dissociation). A dissociation of a Boolean ex- 
pression cf)(x. A) is a new expression 0'(x', A) so that there exists 
a substitution : x' — > x that transforms the new into the origi- 
nal expression: ^'(0(x' ,A)) — 0(x, A). The probability P[(p'] is 
evaluated by assigning each new variable x'. e x' independently 
a new probability p\. 

Example 3.2 (Dissociation). Take the two DNF expressions: 

(p(x) — (f>(xi,X2,Xj,,X4) — XlXj V X1X4 V X2X4 
(p'ix') = (p'{X\,X2,X^,x\,x'^) = V x\x\ V X2x'l . 



Then 4>'{x') is a dissociation of (f>(x), as (f){x) — cf)'(6{x')) for 
the substitution - {(xi, xi), (x2, X2), (x^, x^), (x'^, x^), (x'^,X4)}. 
Furthermore, assigning x'^ and x'^ the same probability as X4 
(i.e. p'^ — p'4 — Pa) makes P[0'(x')] an upper bound to P[0(x)]. 
This follows from P[(/i(x)] = PxPt, + P\Pa + PiPa - PiPiPA - 
P\P3P2P4 - PiPaPi + P\P3P4P2, whereas P[0'(x')] = PiPi + 
P\P4 + P2P4 - P1P3P4 - P1P3P2P4 - P\pIp2 + P\P3pIp2, and 

thus P[0'(x')] - P[0(x)] = {p\P2P3P4 - P\P2P4)(P4 - 1) > 0. 

Next consider the two DNF expressions 

i^(y) — >J/(xi,X2,X4) — X1X3 V X\X4 V X1X4 
i/''(y') - i/^'(xi,X2,x\,x^) - X1X2 V xix'4 V X1X4 . 

Then if/'iy') is a dissociation o/i/'(y), as ipiy) — il/'iOiy')) for the 
substitution 6 — {{xy, X\),{xi, x^),{x'^, X4), {x'^,X4)]. Assigning 
again p'^ — p'4 — P4, gives both expressions the same probabil- 
ity PWly')] = /?i/?3 +P4- P\P-iP4 = P['A(y)]- 

Both of above dissociations follow a more general template 

w(z, A) = ll>{\x4\, {Aq, A], A2)) = Ao V A1X4 V A2X4 
w'(z',A) = (jj'{{x\,x'1],{Aq,Ai,A2]) = Ao V A1.X4 V A2JC4' 

with A, representing the following composed events: Aq = X1X3 
andAi = xi for both, A2 — X2for cp, andA2 — xyfortp. to'(y') is 
a dissociation of Lj(y), as co(y) — to'(0(y'))for the substitution 
- {(x'^,X4), (x'^,X4)}. As we show in this paper, the probabil- 
ity of the dissociation P[w'(z', A)] is always an upper bound to 
P[w(z, A)] irrespective of what expressions are substituted for 
A, and as long as they are independent of X4, and as long as 
p'4 — p'4 — P4- Also, for some expressions A,-, those bounds 
actually become tight (whenever Ai and A2 are identical). Fur- 
thermore, we cannot find values for p'^ and p'^ which give better 
bounds for all possible A. We call such bounds statically-tight. 



Example 3.2 informally introduces the idea of statically- 
tight bounds for dissociated expressions. Intuitively, we are 
interested in bounding the probability of an event expression 
0(x, A) with another event expression i^(x', A), where ^ and if/ 
use the same complex events A with unknown probabilities and 
correlations, but different primitive events x and x' with speci- 
fied probabilities. In particular, we are interested in "the best" 
probability assignments p' to x' of \p (i.e. those values that give 
the tightest bounds) without knowing the probabilities of and 
correlations between events A. We call such bounds statically- 
tight and define them as follows for the upper case: 

Definition 3.3 (Upper bound). Given an event expression ^{x, A) 
with p = P[x]. Another event expression ip(x',A) with p' = 
P[x'] is an upper bound of(f> iff 

VA : P[0(x,A)] < P[i//(x',A)] . 

Definition 3.4 (Statically-tight upper bound). An upper bound 
becomes statically-tight iff: 
(i) 

3A : P[0(x, A)] = P[i/'(x', A)] , 
where A must not be trivial, i.e. P[A,] and P[A/] ^ 1. 
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(ii) 

Vp" = P[x"].3A : P[(/r(x',A)] < P[i//(x",A)] , 

where p" p', i.e. there are no other probability assign- 
ments for 'a' which give tighter bounds. 

Lower bounds are defined analogously. We use the name 
statically-tight in order to distinguish from dynamically-tight 
bounds, i.e. assignments of probabilities that are optimal with 
regard to a particular probability assignment to all variables.' 
Static assignments are made without knowledge of the proba- 
bilities of the events A, nor their mutual correlations. The rea- 
son for this restriction is that finding the best approximation 
for a particular instance may often be NP-hard itself. Restrict- 
ing oneself to the best possible approximation that can be done 
without analyzing the dependencies of the non-dissociated vari- 
ables, i.e. by just considering the syntactic expressions during 
dissociation, can give very fast algorithms |6|. 

What we show in this paper is that dissociated expressions 
together with appropriate probability assignments to the newly 
dissociated variables can lead to upper and lower bounds for 
the original expressions. Furthermore, we show that there are 
probability assignments which are statically-tight. 

4. Statically-tight Bounds for Dissociated Expressions 

This section states the main results of this paper. 

Theorem 4.1 (Disjunctive Dissociation). Let x be a Boolean 
random variable with probability p and Aq, . . . ,A„ arbitrary 
Boolean events independent of x. Then the probability P[0j] of 
the disjunctive expression 

(p^ - AqV xA\ V xA^ V ... V xAn 

can be upper and lower bounded by the probability P[<^y of its 
dissociation 

0^ = Ao V XxAx V X2A2 V ... V XnAn 

with Xi as new independent random variables, and 

(a) Pi > p for upper bounds; and 

(b) Pi < p, s.t. 11/(1 - pi)> 1 - p for lower bounds. 
Furthermore, P[(/i^] becomes a statically-tight upper bound for 
Pi = p, and a statically-tight lower bound for Pi < p, s.t. Y\i(^~ 
Pi) = ^ ~ P- Requiring all pi to be the same, the symmetric 
statically-tight lower bound results from /?, = !- -^/l - p. 

Theorem 4.2 (Conjunctive Dissociation). Let x be a Boolean 
random variable with probability p and Aq, . . . ,A„ arbitrary 



'Note that this distinction is analogous to query-centric vs. data-centric 
rewriting of queries in databases. Query-centric algorithms only consider the 
syntactic query expression, whereas data-centric algorithms can also make use 
of particulaiities in the data instance. The query-centric technique that moti- 
vates this paper is dissociation for probabilistic databases 161, which approxi- 
mates the probability scores of queries over probabilistic without looking at the 
data-instance. Analogously, statically-tight bounds give the best bounds that 
can be given without evaluating the actual expressions represented by A, . 
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Figure 1: Symmetric probability assignments for a disjunctive or a conjunctive 
dissociation to become a statically-tight upper or lower bound. 



Boolean events independent of x. Then the probability P[0c] of 
the conjunctive expression 

0f = Ao A (x V Ai) A (x V A2) A . . . A (x V A„) 

can be upper and lower bounded by the probability P\(p'c\ of its 
dissociation 

0; = Ao A (xi V Ai) A (X2 V A2) A . . . A (x„ V A„) 

with Xi as new independent random variables, and 

(a) Pi > p, s.t. Wi Pi ^ P for upper bounds; and 

(b) Pi < p for lower bounds. 

Furthermore, P[(^^] becomes a statically-tight upper bound for 
Pi > p, s.t. Yli Pi ^ P' <^nd a statically-tight lower bound 
for Pi — p. Requiring all pi to be the same, the symmetric 
statically-tight upper bound results from pi — ifp. 



Figure 1 gives an overview of the symmetric statically-tight 
bounds for dissociated expressions. Remember that statically- 
tight implies that one cannot find probability assignments for 
X, in <f)' that result in generally tighter bounds without knowing 
the probabilities of and the mutual correlations between A,-, and 
note that symmetric implies that all are the same. 

5. Binary Dissociation Bounds 

This section formulates and proves our bounds for the case 
when a single variable is dissociated into two new independent 
variables. The general case then simply follows from induction 
on the number of new variables. 

Proposition 5.1 (Upper Disjunctive Dissociation). Let x be a 
random variable with probability p, and A, B, C be Boolean 
events independent of x. The probability of 

cf>4 - xAw xBW C 

can then be upper bounded by the probability of 

4>"j - xiA V X2B V C 

with Xi as new random variables with pi > p. Furthermore, 
Pi — p is a statically tight upper bound. 



Proof By splitting on event C (Eq. 3 1, we get 



P[0rf] = P[xAC V xBC] + P[C] 
P[02] = ^[xiAC V xiBC] + P[C] 
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Hence, the proof follows from comparing the probabilities of 

if/d - xD V xE 
4/"j = xiD V X2E 

with D :- AC and E := BC as new events. From disjunctive 



inclusion-exclusion ( Eq. 1 



I, we get 

P[xD] + P[xE] - P[xDE] 
= pP[D] + pP[E] - pP[DE] 
P^] = nxiD] + P[X2E] - P[xiX2DE] 
^piP[D]+p2P[E]-pip2P[DE]. 

Hence, P[iJ/"J > P[iJ/dl and also P[4>"^] > P[(pd] iff 

A = (pi - p)P[D] + (p2 - p)P[E] + (p - piP2)P[DE] > . 

This is guaranteed to hold if > p. From monotonicity of A in 
pi and p2 follows that the smallest such bound is given for p, = 
p. Furthermore, this bound is tight in case that events A and B 
are disjoint and thus P[DE] - P[ABC] = 0. Since we assume 
lack of knowledge of the probabilities of and the correlations 
between A and B, the latter bound is statically-tight. □ 

Proposition 5.2 (Upper Conjunctive dissociation). Let x be a 
random variable with probability p, and A, B,C be Boolean 
events independent of x. The probability of 

(pc - (xW A) AixW B) AC 

can then be upper bounded by the probability of 

(f)" = (xi V A) A (X2 V B) A C 

with Xi as new random variables and choosing pi > p, s.t. 
P1P2 ^ P- Furthermore, choosing pi > p, s.t. pip2 — p re- 
sults in a statically tight upper bound (e.g. by setting pi — yfp). 



Proof. By splitting on event C ( Eq. 4 1, we get 



P[<Pc] = P[(-^ V AV C) A(xV BV C)]+ P[C] - 1 
P[<p"] = P[(xi V A V C) A (X2 V B V C)] -H P[C] - 1 . 

Hence, the proof follows from comparing the probabilities of 

i//c - (xW D) A (xW E) 

lA" = (Xi V D) A (X2 V E) 

with D := A V C and £ := B V C as new events. From conjunc- 



tive inclusion-exclusion ( Eq. 2 1 and subsequent event splitting 



( Eq. 3 1 on D, E, and Dw E,we get 

P[i//^] = P[x W D]+ P[x V E]-P[xV DV E] 

= P[xD] + P[D] + P[xE] + P[E] - P[xDE] - P[D V E] 
= pP[D] + pP[E] - pP[DE] + P[D] + P[E] - P[D V E] 

P[C] = P[^i V £)] -H P[X2 V £] - P[xi V X2 V D V £■] 
= P{xiD] + P{D] + P{X2E] + P{E] 

- P[{xi V X2)DE] - P[D V E] 
= pxP[D] + p2P[E] - ipi +P2- piP2)P[DE] + 5 . 



Hence, P[ijj'^] > Piipc], and also P[0;,'] > P[0c] iff 

A = (pi - p){P[D] - P[DE]) + (P2 - P)(P[£] - P[DE]) 

+ (piP2 - pmOE] > . 

This is guaranteed to hold if pj > p and piP2 > P- Since A 
is monotone in the smallest such bounds result from setting 
Pi > p, s.t. p\p2 = P- Furthermore, these bounds are tight 
if A and B are identical, and hence P[D] = P[E] = P[DE]. 
Since we assume lack of knowledge of the probabilities of and 
correlations between A and B, these latter bounds are statically- 
tight. The symmetric statically-tight bound results from setting 
p\ = p2, and hence p, = ^p. □ 

Proposition 5.3 (Lower Disjunctive Dissociation). Let x be a 
random variable with probability p, and A, B, C be Boolean 
events independent of x. The probability of 

(pd - xAV xBV C 

can then be lower bounded by the probability of 

0^ - XyA V X2B V C 

with Xi as new random variables and choosing pi < p, s.t. 
(1 — p\){l — P2) > I - p. Furthermore, choosing pi < p, s.t. 
(1 — pi)(l — P2) — ^— P results in a statically-tight lower bound 
(e.g. by setting pi - \ - ^Jl-p). 

Proof. Using De Morgan's laws, we can write the complements 
of (pd and (p'^ as: 

(pd - (xW A) AixW B) AC 
ip'j - (x, V A) A (x2 V B) A C 

Treating the complements x, X\, X2,A, B,C, D - AvC, and E - 



B V C as new events, and applying Prop. 5.2 we know that 
P[0j] > P[(pd] if Pi ^ P and pip2 > P- Hence, it follows that 
P[(p'j < P[(pd] if Pi < p and (1 - pi)(l- P2) > 1 - P, and that the 
largest such bounds result from setting < p, s.t. (1 - /:>i)(l - 
P2) - \ - p. From the proof of Prop. 5.2 it further follows 



that these latter bounds become tight if A and B are identical, 
and hence P[Z)] - P[£'] = P[DE]. Since we assume lack of 
knowledge of the probabilities of and correlations between A 
and B, these latter bounds are statically-tight. The symmetric 
statically-tight bound results from setting pi - p2, and hence 
Pi = 1 - a/I - P- 



□ 



Proposition 5.4 (Lower Conjunctive Dissociation). Let x be 
a random variable with probability p, and A, B, C be Boolean 
events independent of x. The probability of 

(pc - (x W A) A (x W B) A C 

can then be lower bounded by the probability of 

(p[ = (xi V A) A (X2 V B) A C 

with Xi as new random variables with pi < p. Furthermore, 
Pi — p is a statically tight lower bound. 
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Proof. Using De Morgan's laws, we can write the complements 
of (pc and f*' 



as: 



be - xAv xBV C 
b[. - XyA V X2B V C 



Treating the complements x, x i,X2,A, B, C, D = AC, and E = 
BC as new events, and applying Prop. 5.1 we know that P[0' ] > 



P[^c] if Pi > P- Hence, it follows that < P[(f>c] if Pi < p, 
and that the largest such bound results from pi - p. From the 
proof of |Prop. 5.1| it further follows that this bound is tight if 
A and B are disjoint (i.e. P[A V B = 1]) and hence V[DE] = 
V[ABC] - 0. Since we assume lack of knowledge of the prob- 
abilities of and correlations between D and E, this bound is 
statically-tight. □ 



6. Illustration 

We illustrate our symmetric statically-tight bounds for the 
disjunctive expression ^4 - xAy xB and the conjunctive expres- 
sion 0c = (jc V A)(x V B). Let p = P[x], q = P[A] = P[B], and 
assume x to be independent of A and B, which can be arbitrarily 
correlated: -1 < p{A,B) < 1. Further, let /?, - P[xi] = P[x2] 
be the probabilities in the dissociated expressions. 

In a few steps, one can calculate the probabilities of 0^, (p^ 
and their dissociations as 



P[0rf] = 2pq - pP[AB] 
P[0;,] = Ipiq - p?V{AB] 
Vm^P + d-pWB] 

■ 2piq + pi\l - 2q) + (1 



■ Pi^)P[AB] 



[Figure 2| illustrates the probabilities of the expressions and their 
symmetric statically-tight upper and lower dissociations for var- 
ious values of p, q and as function of the correlation p{A, B), 
and by setting according to Fig. 1 Remember that p(A, B) - 

P[AB]- 



q-q- 

V[AB] 



and, hence: P[AB] = p(A, B) ■ (q- q^) + q-^. Further, 
= (i.e. disjointness between A and B) is not possible for 
q > 0.5, and from P[A V B] < 1, one can derive P[AB] >2p-l. 
In turn, p = -1 is not possible for q < 0.5, and it must hold 
P[AB] > 0. From both together, one can derive the condition 

Pit 



= max(-j^,-ii£^). 



This marks the beginning of 



the graphs for different values of q in the figure. 
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Appendix A. Representing Complex Events 

It is known from Poole's independent choice logic fTT) that 
arbitrary correlations between events can be composed from 
disjoint-independent events only. A disjoint-independent event 
is represented by a non-Boolean independent random variable 
X which takes either of k values v = (vi , . . . , v^) with respective 
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probabilities p - (pi,. . ., pi^) and 2,- Pi - I. Poole writes such 
a "disjoint declaration" as x([vi: pi, . . . ,vic: pic]). 

In turn, any k disjoint events can be represented starting 
from k - 1 independent Boolean variables y = {yu - ■ ■ ,yii-\) 



Proof Theorem 4.2 (a) The proof follows from Prop. 5.2 and 



and probabilities P[y] - (pi 



P2 



P\ ' P\P2 ' ■ ■ ■ ' P\-Pk-- 

the disjoint-independent event variable x its value v, whenever 
event A, is true with A, defined as: 



Pk- 



;■), by assigning 



{x = vi) = Ai := xi 
{x - V2) = A2 := xxX2 

(x = Vk-i) = Ai, := xi . . . Xk^iXk-x 
(x = Vk) = Ai := xi . . . Xk-2Xk-i . 

For example, a primitive disjoint-independent event variable 



x{v\ : ^, vi : 5, vi : ^, V] : ji;) can be represented with three inde- 
pendent Boolean variables y = {yi,y2,y-i) and P[y] - ( j, g, j). 

It follows that arbitrary correlations between events can 
be modeled starting from independent Boolean random vari- 
ables alone. For example, two complex events A and B with 



P[A] — P[B] = q and varying correlation (see Sect. 61 can 
be represented as composed events A :- yiy2 V ^3 V yu and 
B '■- y\y2 V ^3 V ^5 over the primitive events y with vary- 
ing probabilities P[y]. Events A and B become identical for 
P[y] = (0,0,^,0,0), independent for P[y] = (0,0,0,^,^), and 
disjoint for P[y] = (0.5, q, 0, 0, 0) with ^ < 0.5. 

Appendix B. Proofs of Main Theorems 



ProoJ ^Theorem 4.1 (a)\ The proof follows from Prop. 5.1 and 
induction on the number of new variables for each original dis- 
sociated variable. Assume we know that P[0^] > V[4>d\ for 

= A() V xAi V . . . V xA„ = Aq V x(Ai V ■ ■ ■ V A„) 



= An V XiAi V 



V Xk-iAk-i V Xk{Ak V ■ ■ ■ V A„) 



with pi > p,i e [k] and k < n. We need to show that then 
P[<^J+'] > P[(^c], with Pi > p,i e[k+l] for 



: Aq V XiAi V ■ ■ ■ V XkAk V Xk+i(Ak+i V ■ ■ ■ V A„) 



Applying Prop. 5.1 to 0*, we know P[(^* ] > P[0^] for 



<^^' = Ao V xiAi V ■ ■ ■ V Xk-iAk-i V x'^Ak V x;'(Ai+i V ■ ■ ■ V A„) 

with p'l^ > pk > p and p'^ > pk > p. Hence, we have shown 
P[<^^+i] > P[4] > P[0rf] after substituting and pk^i ^ 

p^'. Since 0^ is monotone in p,, it follows that the smallest such 
upper bound results from choosing /?,■ = p. Furthermore, this 
bound is tight in case the events A,-, / e [n] are disjoint, since 
then F[<p''] = P[Ao] + pP[AiAo] + ■■■+ pF[A„Ao] = F[4>d]. 
Since we assume lack of knowledge of the probabilities of and 
correlations between events A,, the bound is statically tight. □ 



induction on the number of new variables for each original dis- 
sociated variable. Assume we know that P[0j] > P[(pc] for 

Cpc - Aq A (xVAi) A ■ ■ ■ A (xVA„) = Ao A (xVAi ■ ■ ■ A„) 
0^ = Ao A (xi vAi) A ■ ■ ■ A (xk-i wAk-i) A (xkVAk ■ ■ ■ A„) 

with Pi > p, s.t. Y[i£[k] Pi ^ ™d k < n. We need to show that 
then P[<^*+i] > P[0J, with pi > p, s.t. Uie[k+i] Pi > P for 

0*+' = Ao A (xi VAi) A ■ ■ ■ A (xkVAk) A (Xk+i WAk+i 



■■A„) 



Applying [Prop. 5.2|to 0*, we know P[0*'] > P[0*] for 



0*=AoA(xiVAi)A---A (Xk- 1 V Ai_ 1 ) A (x^ V Ai) A (Xi V Ai+ 1 ■ ■ -A,,) 

with p[ > Pk > p, p'l > Pk > p, and p[p'^ >pk>P- Hence, it 
follows that p\ - ■ ■ Pk-ip'kP'k ^ P\ ■ ■ ■ Pk ^ P- Hence, we have 
shown P[<^c^'] > P[(f>c] > P[4>c] after substituting pk <- p'f, 
and pk+i «— p'l!- Since 0^ is monotone in it follows that 
the smallest such upper bounds result from choosing pi > p, 
S t- Y\ie[n] Pi - P- Furthermore, these bounds are tight in case 
A,, / e [n] are identical, since then P[0^,] = P[(xi ■ ■ ■ x„)(Ai V 
Ao] + P[Ao] - 1 = pP[Ai V Aq] + P[Ao] - 1 = P[«^c] ■ Since we 
assume lack of knowledge of the probabilities of and correla- 
tions between events A,, these latter bounds are statically-tight. 
The symmetric statically-tight bound results from additionally 
requiring = pj, and thus setting pi - □ 



Proof Theorem 4.1 (b) Using De Morgan's laws, we can write 
the complements of (pij and 0^ as: 

= Aq a (xVAi) A ■ ■ ■ A (xVA„) 
= Ao A (xi V Ai) A ■ ■ ■ A (x„ VA„) 

From 
s.t. 



Theorem 4.2 (a) we know that P[0^] > P[(f>d] if Pi ^ P, 



\iPi > p. Hence, it follows that P[(f>'J < P[(f>d] if Pi < 
p, s.t. ]"[/(! ~ Pi) ^ 1 ~ and that the smallest such bounds 
result from p,- < p, s.t. Y\ii^ ~ Pi) - ^ ~ P- Furthermore, these 
later bounds are tight in case A,-, ; G [«] are identical, since then 
PWa\ = P[(^i V ■ ■ ■ V x„)AiAo] + P[Ao] - 1 = pP[AiAo] + 
P[Ao] - 1 = P{4>d]- Since we assume lack of knowledge of 
the probabilities of and correlations between events A,-, these 
latter bounds are statically-tight. The symmetric statically-tight 
lower bound results from additionally requiring - pj, and 
thus setting /?, = !- -^1 - p. □ 



ProoJ ^Theorem 4.2 (b)\ Using De Morgan's laws, we can write 
the complements of (f>c and as: 

- AqV xA[ V . . .V xAn 

^;=AoVxiAi V...Vx„A„ 



From Theorem 4.1 (a) we know that P[(f>'J > P[(pc] if Pi > P- 
Hence, it follows that P[0^.] < P[0c] if p, < p, and that the 
smallest such bound results from setting p, = p. Furthermore 
this latter bound is tight in case A,, / € [n] are disjoint (this is 
trueifP[A, vAj] = 1), since then P[0;^] ^P[Ao]+(l-p)P[Ai] + 
■ ■ ■ + (l-p)P[A„] = P[^c]- Since we assume lack of knowledge 
of the probabilities of and correlations between events A,, this 
bound is statically-tight. □ 
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Appendix C. Nomenclature 



X, y, z primitive events: independent Boolean random variables 

(/>, i//, o) composed events: Boolean event expressions 

A,B,. . . complex events: events composed from primitive events 

with unknown event expressions 
P[A], P[0] probability of an event or expression 
Pi probability P[x,] of random variable X; 

X, A, p ordered sets (x[, . . . , xt) or unordered sets [xi,..., Xk) of 

variables, events or probabilities 
x,A, 0, p complements: -'X, -lA, \ - p 
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